Semantic MapNet: Building Allocentric Semantic Maps and Representations from Egocentric Views
نویسندگان
چکیده
We study the task of semantic mapping – specifically, an embodied agent (a robot or egocentric AI assistant) is given a tour new environment and asked to build allocentric top-down map (‘what where?’) from observations RGB-D camera with known pose (via localization sensors). Importantly, our goal neural episodic memories spatio-semantic representations 3D spaces that enable easily learn subsequent tasks in same space navigating objects seen during (‘Find chair’) answering questions about (‘How many chairs did you see house?’). Towards this goal, we present Semantic MapNet (SMNet), which consists of: (1) Egocentric Visual Encoder encodes each frame, (2) Feature Projector projects features appropriate locations on floor-plan, (3) Spatial Memory Tensor size floor-plan length×width×feature-dims learns accumulate projected features, (4) Map Decoder uses memory tensor produce maps. SMNet combines strengths (known) projective geometry representation learning. On Matterport3D dataset, significantly outperforms competitive baselines by 4.01?16.81% (absolute) mean-IoU 3.81?19.69% Boundary-F1 metrics. Moreover, show how use for ObjectNav Embodied Question Answering. Project page: https://vincentcartillier.github.io/smnet.html.
منابع مشابه
Egocentric and allocentric representations in auditory cortex
A key function of the brain is to provide a stable representation of an object's location in the world. In hearing, sound azimuth and elevation are encoded by neurons throughout the auditory system, and auditory cortex is necessary for sound localization. However, the coordinate frame in which neurons represent sound space remains undefined: classical spatial receptive fields in head-fixed subj...
متن کاملAllocentric and Egocentric Spatial Representations: Definitions, Distinctions, and Interconnections
Although the literatures on human spatial cognition and animal navigation often make distinctions between egocentric and allocentric (also called exocentric or geocentric) representations, the terms have not generally been well defined. This chapter begins by making formal distinctions between three kinds of representations: allocentric locational, egocentric locational, and allocentric heading...
متن کاملSimilarity Semantics and Building Probabilistic Semantic Maps from Parallel Texts
This paper deals with statistical (non-implicational) semantic maps, built automatically using classical multidimensional scaling from a direct comparison of parallel text data (the Gospel according to Mark) in the domain of motion events (case/adpositions) in 153 languages from all continents in 190 parallel clauses. The practical objective of is to present one way (among other possible ways) ...
متن کاملBuilding Semantic Annotated Maps by Mobile Robots
The work presented here explains a framework to build semantic annotated maps from laser range measurements of a mobile robot. A hand-crafted and a learning classifier is explained. Two alternative methods to aggregate the resulting class membership vectors into a grid map have been developed and are presented. Both alternative methods will be motivated and described in detail and discussed cri...
متن کاملFrom Feature Maps to Semantic Landscapes: Negotiating the Semantic Gap
In this paper, we present the results of a project that seeks to transform low-level features to a higher level of meaning. This project concerns a technique, latent semantic indexing (LSI), in conjunction with normalization and term weighting, which have been used for full-text retrieval for many years. In this environment, LSI determines clusters of co-occurring keywords, sometimes, called co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i2.16180